SCORE Search Results Details for Application 10529592 and Search Result 20090427_122917_us-10-529-592a- 



IIB 

iiiiiiii^^ 


DRE Search Results Details for Application 10529592 and 
earch Result 20090427 1 2291 7 _us-1 0-529-592a-1 .rst. 


Score Home 
F-acie 


Retrieve Appiication 
List 


SCORE System 
Overview 


SCORE 
FAQ 


Comments / 





This page gives you Search Results detail for the Application 10529592 and Search Result 
20090427_1 2291 7_us-1 0-529-592a-1 .rst. 



Go Back to previous page 



GenCore version 6.3 
Copyright (c) 1993 - 2009 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



April 28, 2009, 04:16:35 ; Search time 1875 Seconds 
(without alignments) 
45457.283 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-529-592A-1 

881 

1 gggccatgacccccgctgct . 



. aaataaagatcctctgtaac 881 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 83780570 seqs, 48372533981 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


710.6 


80 


7 


979 


5 


BQ672221 


BQ672221 


AGENCOURT 




2 


703 


79 


8 


916 


5 


BU527159 


BU527159 


AGENCOURT 




3 


685.6 


77 


8 


735 


4 


BI914593 


BI914593 


603179545 




4 


650 . 4 


73 


8 


889 


5 


BQ671560 


BQ671560 


AGENCOURT 




5 


563 


63 


9 


586 


3 


BE875115 


BE875115 


601485988 


c 


6 


549 


62 


3 


687 


4 


BM977476 


BM977476 


UI-CF-ENl 


c 


7 


547.4 


62 


1 


677 


4 


BQ045152 


BQ045152 


UI-CF-ENl 




8 


545.6 


61 


9 


840 


4 


BI546970 


BI546970 


603190185 




9 


545 . 4 


61 


9 


838 


7 


CF993841 


CF993841 


AGENCOURT 


c 


10 


543 . 8 


61 


7 


600 


1 


AA760709 


AA760709 


nzl3cll . s 


c 


11 


539.8 


61 


3 


573 


4 


BQ025351 


BQ025351 


UI-l-BBlp 


c 


12 


539.8 


61 


3 


581 


4 


BQ025360 


BQ025360 


UI-l-BBlp 




13 


536 . 4 


60 


9 


591 


4 


BM711587 


BM711587 


UI-E-CLl- 


c 


14 


536 


60 


8 


539 


1 


AI973037 


AI973037 


wr46g08 .x 




15 


530.2 


60 


2 


843 


7 


CB993691 


CB993691 


AGENCOURT 




16 


510.2 


57 


9 


577 


5 


BQ425608 


BQ425608 


AGENCOURT 


c 


17 


501 . 4 


56 


9 


541 


4 


BQ025279 


BQ025279 


UI-l-BBlp 




18 


501.2 


56 


9 


542 


4 


BM688675 


BM688675 


UI-E-CQl- 


c 


19 


491 


55 


7 


495 


2 


AW072412 


AW072412 


xa07f 07.x 


c 


20 


486.4 


55 


2 


491 


1 


AI969836 


AI969836 


wq75f01.x 




21 


474 


53 


8 


474 


4 


BM725994 


BM725994 


UI-E-EJO- 




22 


472.4 


53 


6 


788 


7 


CD101714 


CD101714 


AGENCOURT 



http://es/ScoreAccessWeb/GetItem.action?AppId=1052959...27_122917_us-10-529-592a-l.rst&ItemType=4&startByte=0 (2 of 35)5/19/2009 9:51:31 AM 



SCORE Search Results Details for Application 10529592 and Search Result 20090427_122917_us-10-529-592a-l.rst 





23 


460 . 6 


52 . 


,3 


593 


18 


W56389 


W56389 


zc57b01 .rl 


c 


24 


453 


51 . 


. 4 


456 


1 


AI824062 


AI824062 


wj35fl0.x 


c 


25 


452 . 4 


51 . 


. 4 


469 


4 


BM681000 


BM681000 


UI-E-EJO- 


c 


26 


447 . 4 


50 . 


. 8 


460 


1 


AI201581 


AI201581 


qc02g09 .X 




27 


438 . 4 


49 . 


. 8 


443 


8 


CN366258 


CN366258 


170006008 


c 


28 


434 . 8 


49 . 


, 4 


456 


5 


BU727045 


BU727045 


UI-E-CQl- 




29 


433 


49 , 


. 1 


447 


7 


CD674248 


CD674248 


f s08d06 .y 


c 


30 


417 . 6 


47 . 


. 4 


439 


2 


AW779356 


AW779356 


hn80b03 .X 




31 


417 


47 . 


.3 


626 


4 


BM688212 


BM6 8 8212 


UI-E-CLO- 




32 


416 . 2 


47 . 


. 2 


803 


7 


CB998149 


CB998149 


AGENCOURT 


c 


33 


404 . 8 


45 . 


. 9 


412 


1 


AI042370 


AI042370 


ox62b07 .X 


c 


34 


404 


45 , 


. 9 


429 


1 


AA827147 


AA827147 


ob53c05 . s 




35 


397 


45 , 


. 1 


398 


5 


BX096382 


BX096382 


BX096382 




36 


383 . 6 


43 . 


, 5 


413 


1 


AA490346 


AA490346 


aa44a05 .r 


c 


37 


380 


43 . 


, 1 


381 


1 


AI221443 


AI221443 


qg75c09 . X 




38 


372 . 8 


42 , 


.3 


616 


3 


BG285710 


BG285710 


602380756 




39 


367 


41 , 


, 7 


574 


18 


W37398 


W37398 


zcllalO .rl 


c 


40 


360 


40 , 


. 9 


365 


2 


AW419070 


AW419070 


hal7cll .X 


c 


41 


357 . 4 


40 . 


. 6 


361 


2 


AW439942 


AW439942 


hallell .X 


c 


42 


356 . 4 


40 . 


, 5 


384 


5 


BU726946 


BU726946 


UI-E-CLO- 


c 


43 


351 . 4 


39 . 


. 9 


366 


1 


AI216467 


AI216467 


qh07h08 .x 


c 


44 


351 . 2 


39 , 


. 9 


386 


18 


N98734 


N98734 


zb87f 07 . si 




45 


338.4 


38. 


. 4 


1094 


4 


BM925906 


BM925906 


AGENCOURT 



ALIGNMENTS 



RESULT 1 
BQ672221 
LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BQ672221 979 bp rtiRNA linear EST 15-JUL-2002 

AGENCOURT_8303632 NIH_MGC_102 Homo sapiens cDNA clone IMAGE : 6274816 
5 ' , mRNA sequence . 
BQ672221 

BQ672221.1 GI:21783055 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 

1 (bases 1 to 979) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
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cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLCM2456 row: p column: 17 
High quality sequence stop: 550. 
FEATURES Location/Qualifiers 
source 1. .979 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 96 06" 
/clone=" IMAGE: 6274816" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_102" 

/note="Organ : salivary gland; Vector: pOTB7; Site_l: Xhol; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG(G), Library constructed 
by Ling Hong in the laboratory of Gerald M, Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 



Query Match 80 . 7%; 

Best Local Similarity 97.3%; 
Matches 786; Conservative 



Score 710.6; I 
Pred. No. 1 . 7e- 
0; Mismatches 



B 5; 
163; 
15; 



Length 9 79; 



Indels 



Gaps 



Qy 



1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 82 



Qy 
Db 
Qy 

Db 
Qy 
Db 
Qy 
Db 



6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

83 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 142 

121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I ! I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

143 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 2 02 

181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 03 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 26 2 

241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 30 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

263 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 322 
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Qy 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 323 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 82 

Qy 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 383 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 4 42 

Qy 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 443 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 502 

Qy 481 CTGAATACCCTGGATGGGAACTGAGCGAA-CCCGGGCCTCCGCTCAGAGAGACGTGGCAG 53 9 

I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 503 CTGAATACCCTGGATGGGAACTGAGCGAACCCCGGGCCTCCGCTCAGAGAGACGTGGCAG 56 2 

Qy 540 GACCAGCGAGGAATCCAGCCTGTCCA-CTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAG 598 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 563 GACCAGCGAGGAATCCAGCCTGTCCACCTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAG 6 22 

Qy 599 TGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGC 658 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 623 TGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGNTGAAAGGGAGCGC 6 82 

Qy 65 9 CATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCT-CTTCTGGACAAACACACCC-TC 716 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 6 83 CATGGTCCTGGCTGGTGGGGTCCCAGGGAGAGGCTCTCCTTCTGGACAAACACACCCTTC 7 42 

Qy 717 CCAGCCCCCAGGGCTGTGCAAACACATG-CCCCTGCCATAAGCACCAACAAGAAC— TTC 7 73 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I III I 
Db 743 CCAGCCCCCAGGGCTGTGCAAACACATGCCCCCGGCCATAAGCCCCCACAAAAACCTTCT 8 02 

Qy 774 TTGCAGGTGGAGTGGCTGTTTTTTATAA 801 

I I I I I I I I I I I I I I I I I I I I I 
Db 803 TGGCCGGGGGAAGGGCTTTTTTTTTTAA 830 



RESULT 2 
BU527159 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



BU527159 916 bp mRNA linear EST 13-SEP-20 

AGENCOURT_1015 48 78 NIH_MGC_101 Homo sapiens cDNA clone 
IMAGE : 6 536 921 5', mRNA sequence. 

BU527159 

BU527159.1 GI:22837600 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
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Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 916) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/, 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpublished (1999) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : / / image . llnl . gov 
Plate: LLCM2699 row: a column: 17 
High quality sequence stop: 653. 
FEATURES Location/Qualifiers 
source 1. .916 

/organism="Homo sapiens" 
/mo l_t ype= " mRNA " 
/db_xref ="taxon : 96 06 " 
/clone=" IMAGE: 6536 921" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_101" 

/note="Organ : lung; Vector: pOTB7; Site_l: EcoRI; Site_2 : 
Xhol; cDNA made by oligo-dT priming. Direct ionally cloned 
into EcoRI/XhoI sites using the following 5' adaptor: 
GGCACGAG(G). Library constructed by Ling Hong in the 
laboratory of Gerald M. Rubin (University of California, 
Berkeley) using ZAP-cDNA synthesis kit (Stratagene) and 
Superscript II RT (Life Technologies). Note: this is a 
NIH_MGC Library." 



Query Match 79 . 8 

Best Local Similarity 96.9 
Matches 749; Conservative 



Score 703; DB 5; Length 916; 
Pred. No. 1.2e-161; 
0; Mismatches 20; Indels 



4; Gaps 



Qy 



1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I j I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 2 



Qy 



6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
6 3 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 122 



Qy 



121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 
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Db 123 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 182 

Qy 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 183 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 42 

Qy 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 30 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 243 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 02 

Qy 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 303 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 2 

Qy 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 363 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 422 

Qy 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 423 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 482 

Qy 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 483 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 42 

Qy 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 543 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 02 

Qy 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 603 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 662 

Qy 661 TGGTCCTGGC — TGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCT-CC 717 

I I I I I I I I I I II III II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 663 TGGTCCTGGCCTGGTGGGGTCCCAGGGAAGAGGCTCTCTTCTGGACAAACACACCCTCCC 722 

Qy 718 CAGCCCCCAGGGCTGTG-CAAACACATGCCCCTGCCATAAGCACCAACAAGAA 76 9 

I II II II II II II II II I II II II II II II I I I I I I I I I 

Db 723 CAGCCCCCAGGGCTGTGCCAAACCCATGCCCCCTGCCAATAAGGCACCAACAA 7 75 



RESULT 3 
BI914593 

LOCUS BI914593 735 bp mRNA linear EST 16-OCT-2001 

DEFINITION 603179545F1 NIH_MGC_121 Homo sapiens cDNA clone IMAGE : 5243626 5', 
mRNA sequence . 



http://es/ScoreAccessWeb/GetItem.action?AppId=1052959...27_122917_us-10-529-592a-l.rst&ItemType=4&startByte=0 (7 of 35)5/19/2009 9:51:32 AM 



SCORE Search Results Details for Application 10529592 and Search Result 20090427_122917_us-10-529-592a-l.rst 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI914593 

BI914593.1 61:16178714 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (bases 1 to 735) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 

Plate: LLAM11614 row: j column: 11 
High quality sequence stop: 723. 

Location/Qualifiers 

1. .735 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 96 06" 
/clone="IMAGE: 5243626" 
/lab_host="DH10B" 
/clone_lib="NIH_MGC_121" 

/note="Organ: brain; Vector: pCMV-SP0RT6; Site_l : Not I; 
Site_2: EcoRV (destroyed); RNA source anonymous pool of 3 
fetal brains, female age 20 weeks, female age 24 weeks, 
and male age 26 weeks. Library is oligo-dT primed and 
directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
0.7-3.5 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
( Invitrogen) . Research Genetics tracking code 017. Note: 
this is a NIH_MGC Library." 



Query Match 77.8%; Score 685.6; DB 4 

Best Local Similarity 96.6%; Pred. No. 2.3e-157 
Matches 730; Conservative 0; Mismatches 4 



Length 735; 

Indels 22; Gaps 2; 



Qy 



119 GCAGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGT 178 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 1 GCAGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGT 6 0 



Qy 179 CCTGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCC 23 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 CCTGCCTGAAA TGCAGCAGCAGCAGCCACGACGAGGCCC 99 

Qy 239 CCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCA 298 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I 

Db 10 0 CCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCA 15 9 

Qy 29 9 CGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGT 35 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 16 0 CGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGT 219 

Qy 359 GCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCT 418 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 220 GCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCT 2 79 

Qy 419 GGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTG 4 78 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 28 0 GGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTG 33 9 

Qy 479 TGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCA 53 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 340 TGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCA 399 

Qy 539 GGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAG 598 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 400 GGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAG 45 9 

Qy 59 9 TGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGC 658 

I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 46 0 TGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGC 519 

Qy 659 CATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCC 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 520 CATGGTCCTGGCTGTTGGGGTCCCAGGCAGAGGCTCTCTTCTGGACAAACACACCCTCCC 5 79 

Qy 719 AG-CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGC 777 

II I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 58 0 AGCCCCCCAGGGCTGTGCAAACACATGCCCCTCCCATAAGCACCAACAAGAACTTCTTGC 63 9 

Qy 778 AGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGA 83 7 

I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 40 AGGTGGAGTGGCTGTATTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGA 6 99 

Qy 838 TTTATAATTTCTTTTTTGCATTATAAATAAAGATCC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 700 TTTATAATTTCTTTTTTGCATTATTyy^TAAAGATCC 735 
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RESULT 4 

BQ671560 

LOCUS BQ671560 889 bp mRNA linear EST 15-JUL-2002 

DEFINITION AGENCOURT_8171941 NIH_MGC_102 Homo sapiens cDNA clone IMAGE : 6277428 

5 ' , mRNA sequence . 
ACCESSION BQ671560 

VERSION BQ671560.1 GI:21782394 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 889) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpublished (1999) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLCM2463 row: m column: 13 
High quality sequence stop: 549. 
FEATURES Location/Qualifiers 
source 1. .889 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 96 06" 
/clone=" IMAGE: 6277428" 

/tissue_tYpe="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_102" 

/note="Organ : salivary gland; Vector: pOTB7; Site_l : Xhol; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG(G). Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 
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Query Match 73.8%; Score 65 0.4; DB 5; Length 889; 

Best Local Similarity 93.1%; Pred. No. l.le-148; 

Matches 715; Conservative 0; Mismatches 46; Indels 7; Gaps 3; 

Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I M M M M I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 23 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 82 

Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 83 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 142 

Qy 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 143 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 202 

Qy 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 203 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 26 2 

Qy 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 263 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 322 

Qy 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 323 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 82 

Qy 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 383 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 4 42 

Qy 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 443 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 5 02 

Qy 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 503 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 56 2 

Qy 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGG-CCCCGCTGAGT 599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 563 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCCGCTGAGG 622 

Qy 600 GGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCC 659 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 623 GGACCGGACCTCTGACACCTCCCGGTTCTTGCTGACTCCGGCCTGGGGAAAGGGAACCGC 6 82 

Qy 66 0 ATGGTCC— TGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACATyVCACACCCTCC 717 
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I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I 

Db 683 TTGGTCCCTGGCTGTTGGGGGGCCCAGGGAAAGGGTCGTCTTCGGAGCAAACGAACCTTT 742 

Qy 718 CA GCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACC 761 

I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 743 GAAAGCCCCCCCGGGCATGGACAAAACAAGCCCCCGGTCCTATGGACC 790 



RESULT 5 
BE875115 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BE875115 586 bp mRNA linear EST 20-OCT-2000 

601485988F1 NIH_MGC_69 Homo sapiens cDNA clone IMAGE : 3888621 5', 
mRNA sequence . 
BE875115 

BE875115.1 GI:10323891 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 

1 (bases 1 to 586) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: DCTD/DTP/Gazdar 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM9668 row: o column: 22 
High quality sequence stop: 586. 
Location/Qualifiers 
1. .586 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="t axon: 96 06" 
/clone=" IMAGE: 3 888621" 

/tissue_type=" large cell carcinoma, undifferentiated" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_6 9" 

/note="Organ: lung; Vector: pCMV-SP0RT6; Site_l: Not I; 
Site_2: Sail; Cloned unidirectionally . Primer: Oligo dT . 
Average insert size 1.1 kb. Library constructed by Life 
Technologies . " 
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Query Match 63.9%; Score 563; DB 3; Length 586; 

Best Local Similarity 99.0%; Pred. No. 3.4e-127; 

Matches 5 77; Conservative 0; Mismatches 5; Indels 1; Gaps 1; 

Qy 300 GGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTG 35 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTG 6 0 

Qy 36 0 CCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTG 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTG 12 0 

Qy 420 GCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGT 479 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGT 180 

Qy 48 0 GCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAG 53 9 

I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAG 2 40 

Qy 540 GACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGT 599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGT 300 

Qy 600 GGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCC 659 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCC 36 0 

Qy 66 0 ATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCA 719 

I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCA 42 0 

Qy 720 GCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAG 779 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GCCCCCAGGGCTGTGCAAACACATGCCCCTCCCATAAGCACCAACAAGAACTTCTTGCAG 48 0 

Qy 780 GTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATT 839 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GTGGAGTGGCTGTTTTTTATAAGTCGTTTTACAGATACGGAAACAGTCCAAAATGGGATT 5 40 

Qy 840 TATAATTTCTTTTTTGCATTA-TAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 541 TATAATTTCTTTTTTGCATTACTCACTCAAGATCCTCTGTAAC 583 



RESULT 6 
BM977476/C 
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LOCUS BM977476 687 bp rtiRNA linear EST 21-FEB-2003 

DEFINITION UI-CF-ENl-aee-b-14-O-UI.sl UI-CF-ENl Homo sapiens cDNA clone 
UI-CF-ENl-aee-b-14-O-UI 3', mRNA sequence. 

ACCESSION BM977476 

VERSION BM977476.1 GI:19595931 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 687) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Scares, M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 

discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
PUBMED 8889548 
COMMENT Contact: McCray, PB 

McCray Lab 
University of Iowa 

2024 University of Iowa Med Labs, Iowa City, lA 52242, USA 

Tel: 319 356 4866 

Fax: 319 356 7171 

Email: paul-mccray@uiowa.edu 

Tissue Procurement: Dr. M. J. Welsh, University of Iowa 
cDNA Library preparation: Dr. M. Bento Scares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Scares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Scares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) or from Open Biosystems 

(www.openbiosystems.com) . 

Seq primer: M13 FORWARD 

POLYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .687 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 96 06" 
/clone="UI-CF-ENl-aee-b-14-0-UI" 

/tissue_type="Primary Lung Cystic Fibrosis Epithelial 
Cells" 

/dev_stage= "Adult " 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-CF-ENl" 

/note="Organ : Lung; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l : EcoR I; Site_2 : Not I; 
UI-CF-ENl is a normalized cDNA library containing the 
following tissue(s): Primary Lung Cystic Fibrosis 
Epithelial Cells. The library was constructed according to 
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Bonaldo, Lennon and Scares, Genome Research, 6:791-806, 
1996. First strand cDNA synthesis was primed with an 
oligo-dT primer containing a Not I site. Double stranded 
cDNA was ligated to an EcoR I adaptor, digested with Not 
I, and cloned directionally into pT7T3-Pac vector. The 
oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT)18 tail. The 
sequence tag for this library is CTGCTCAGGT . 
TAG_TISSUE=Human Lung Epithelial Cell Lines untreated LPS 
6hr to LPS 24h 
TAG_LIB=UI-CF-EN1 
TAG_SEQ=CTGCTCAGGT " 



Query Match 62.3%; Score 549; DB 4; Length 687; 

Best Local Similarity 100.0%; Pred. No. 9.6e-124; 

Matches 549; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 3 92 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 575 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 516 

Qy 393 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 515 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 456 

Qy 453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 455 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 3 96 

Qy 513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 

M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 395 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 336 

Qy 5 73 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 632 

I I I I I I I I I I I I I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 335 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 2 76 

Qy 633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 6 92 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 275 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 216 

Qy 693 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 

I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 215 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 156 

Qy 753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 15 5 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 96 



Qy 813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 36 

Qy 873 CTCTGTAAC 881 

I I I I I I I I I 
Db 35 CTCTGTAAC 2 7 



RESULT 7 

BQ045152/C 

LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



BQ045152 



6 77 bp 



mRNA 



linear EST 21-FEB-2003 



UI-CF-ENl-ael-o-16-O-UI.sl UI-CF-ENl Homo sapiens cDNA clone 

UI-CF-ENl-ael-o-16-O-UI 3', mRNA sequence. 

BQ045152 

BQ045152.1 GI:19796231 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 677) 

Bonaldo, M. F . , Lennon,G. and Scares, M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 
8889548 

Contact: McCray, PB 
McCray Lab 
University of Iowa 

2024 University of Iowa Med Labs, Iowa City, lA 52242, USA 

Tel: 319 356 4866 

Fax: 319 356 7171 

Email: paul-mccray@uiowa.edu 

Tissue Procurement: Dr. M. J. Welsh, University of Iowa 
cDNA Library preparation: Dr. M. Bento Scares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Scares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 
Genetics (www.resgen.com) or from Open Biosystems 
(www.openbiosystems.com) . 
Seq primer: M13 FORWARD 
POLYA=Yes . 

Location/Qualifiers 
1. .677 

/organism="Homo sapiens" 
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/mol_type="mRNA" 
/db_xref="taxon: 96 06" 
/clone="UI-CF-ENl-ael-o-16-0-UI" 

/tissue_type="Primary Lung Cystic Fibrosis Epithelial 

Cells" 

/dev_stage= "Adult " 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-CF-ENl" 

/note="Organ : Lung; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l : EcoR I; Site_2 : Not I; 
UI-CF-ENl is a normalized cDNA library containing the 
following tissue(s): Primary Lung Cystic Fibrosis 
Epithelial Cells. The library was constructed according to 
Bonaldo, Lennon and Scares, Genome Research, 6:791-806, 
1996. First strand cDNA synthesis was primed with an 
oligo-dT primer containing a Not I site. Double stranded 
cDNA was ligated to an EcoR I adaptor, digested with Not 
I, and cloned directionally into pT7T3-Pac vector. The 
oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT)18 tail. The 
sequence tag for this library is CTGCTCAGGT . 
TAG_TISSUE=Human Lung Epithelial Cell Lines untreated LPS 
6hr to LPS 24h 
TAG_LIB=UI-CF-EN1 
TAG_SEQ=CTGCTCAGGT " 



Query Match 62.1%; 
Best Local Similarity 99.8%; 
Matches 548; Conservative 



Score 54 7.4; DB 4; 
Pred. No. 2.4e-123; 

0; Mismatches 1; 



Length 6 77; 



Indels 



Gaps 



Qy 



333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 3 92 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
56 4 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 5 05 



Qy 
Db 
Qy 

Db 
Qy 
Db 
Qy 



393 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 45 2 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

504 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 445 

453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

44 4 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTG/^TACCCTGGATGGGAACTGAGCGAACCC 3 85 

513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

384 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 325 

573 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 632 
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I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I 
Db 324 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 265 

Qy 633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 6 92 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 4 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 2 05 

Qy 693 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 204 TCTCTTTTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 145 

Qy 753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 144 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 85 

Qy 813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 
Db 84 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 25 

Qy 873 CTCTGTAAC 881 

I I I I I I I I I 
Db 24 CTCTGTAAC 16 



RESULT 8 
BI546970 
LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BI546970 840 bp mRNA linear EST 05-SEP-2001 

603190185F1 NIH_MGC_95 Homo sapiens cDNA clone IMAGE : 5261660 5', 
mRNA sequence . 
BI546970 

BI546970.1 GI:15434282 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 

1 (bases 1 to 840) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Miklos Palkovits, 

cDNA Library Preparation: Michael J. 
Toshiyuki and Piero Carninci (RIKEN) 

cDNA Library Arrayed by: The I.M.A.G. 

DNA Sequencing by: 

Clone distribution: 



M.D., Ph.D. 

Brownstein 



(NEGRI), Shiraki 



.E. Consortium (LLNL) 
Incyte Genomics, Inc. 
MGC clone distribution information can be 
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found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 

Plate: LLAM11659 row: i column: 21 
High quality sequence stop: 737. 
FEATURES Location/Qualifiers 
source 1. .840 

/organism="Homo sapiens" 

/mo l_t y p e = " mRNA " 

/db_xref="taxon: 9606" 

/clone=" IMAGE: 5 26166 0" 

/ tissue_type="hippocampus" 

/lab_host="DH10B" 

/clone_lib="NIH_MGC_95" 

/note="Organ : brain; Vector: pBluescriptR (modified 
pBluescript KS+); Site_l : BamHI; Site_2 : Sall-Xhol 
(gtcgag) ; Oligo-dT primed using primer 
5>_XTTTTTTTTTTTTTTTVN-3', size-selected for average 
insert size 2.5 kb and normalized to ROT 5. This is a 
primary library enriched for full-length clones and 
constructed using the Cap-trapper method (Carninci, in 
preparation) . Library constructed by M. Brownstein 
(NIMH/NHGRI, National Institutes of Health) . Note: this 
is a NIH_MGC Library." 



Query Match 61.9%; Score 545.6; DB 4; Length 8 40; 

Best Local Similarity 89.1%; Pred. No. 6.8e-123; 

Matches 70 9; Conservative 0; Mismatches 34; Indels 53; Gaps 9; 

Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 49 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 108 



Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 10 9 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 16 8 



Qy 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 16 9 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 22 8 



Qy 181 TGCCTGAAAG GCTTTCAA 198 

I I I I I I I I I I I I I I 

Db 229 TGCCTGAAAGACGCCGGTTTTCATCTGTGATGCGGGGACAGCTGCGCTCCTTGCTGCGAG 288 



Qy 199 ATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCAC 258 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 289 GCGTCAGGACCCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCAC 348 
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Qy 25 9 CTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGG 318 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 349 CTGGACGTGCCCGACATCATCATCACGCCCCCCA-CCCCACGGGCATGATGCTGCCGAGG 40 7 

Qy 319 GACTTGGGGAGCACAGTCTGGCTGG-ATGAGACAGGGTCGTGCCCAGATGATGGAGAAAT 3 77 

I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 408 GACTTGGGGAGCACAGTCTGGCTGGCATGAGACAGGGTCGTGCCCAGATGATGGAGAAAT 46 7 

Qy 3 78 CGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCC 43 7 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 46 8 CGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCC 52 7 

Qy 438 CGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGG 49 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 528 CGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGG 587 

Qy 498 GAAC-TGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCA 556 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 58 8 GAACTTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCA 6 47 

Qy 55 7 GCCTGTCCACTTCCAGAA-CAGTGTTTCCCAGGCCCCGCTG-AGTGGACCGGACCTC-TG 613 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II 
Db 6 48 GCCTGTCCACTTCCAGAACCAGTGTTTCACAGGCCCCGCTGAAGTGGACCGGACCTCTTG 70 7 

Qy 614 ACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGT 6 73 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 708 ACACCTCCAGGTCCTTGCTGACTCCGGCCTGTTGAAAGGGAACGCCATGGTCCTGTGCTG 76 7 

Qy 6 74 TGG — GGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCT 731 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I 
Db 76 8 TGGGAGTCCCAAGGAAGAGGCTCTCTTCTGGACAAAACAACCTCCAAA CCCAAGGCT 82 4 

Qy 732 GTGCAAACACATGCCC 74 7 

I I I I I I I I I I I I I I I 
Db 825 GTGCCAACACATGCCC 840 



RESULT 9 
CF993841 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



CF993841 838 bp rtiRNA linear EST 25-NOV-20C 

AGENCOURT_15622657 NIH_MGC_147 Homo sapiens cDNA clone 
IMAGE: 30705878 5', mRNA sequence. 

CF993841 

CF993841.1 GI:38509901 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
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REFERENCE 
AUTHORS 



JOURNAL 
COMMENT 



TITLE 



Catarrhini; Hominidae; Homo. 

1 (bases 1 to 838) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Daniela S. Gerhard, Ph.D. 



Office of Cancer Genomics 

National Cancer Institute / NIH 

Bldg. 31 RmlOAOV Bethesda, MD 20892 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Stefan Hansson 
cDNA Library Preparation: Michael J. Brownstein (NHGRI) with help 

and advice from Piero Carninci (RIKEN) 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

http : //image . llnl . gov 

Plate: NDAM106 8 row: d column: 15 

High quality sequence stop: 558. 



/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 96 06" 
/clone=" IMAGE: 307058 78" 
/tissue_type="Human Placenta" 
/lab_host="DH10B TonA" 
/clone_lib="NIH_MGC_147" 

/note="Organ : placenta; Vector: pBluescriptR; Site_l : 
all-XhoI; Site_2: BamH; Oligo-dT primed using primer 
5 . _XTTTTTTTTTTTTTTTVN-3 ' , size-selected for average 
insert size 2.3 kb and normalized to ROT 5. This is a 
primary library enriched for full-length clones and 
constructed using the Cap-trapper method (Carninci, in 
preparation) . Library constructed by M. Brownstein 
(NIMH/NHGRI, National Institutes of Health). Note: This is 
a NIH_MGC library." 



FEATURES 



source 



Location/Qualifiers 
1. .838 



ORIGIN 



Query Match 
Best Local Similarity 
Matches 549; Conser^ 



Conservative 



61.9%; Score 5 45.4; DB 7 
98.9%; Pred. No. 7.6e-123 

,ive 0; Mismatches 6 



Length 83 8; 



Indels 



0; Gaps 



0; 



Qy 



32 7 GAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGA 3 86 




Db 



16 GCGTAGAACCAGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGA 75 



Qy 



38 7 AGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAG 4 46 
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I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I 
Db 76 AGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAG 135 

Qy 44 7 GTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGC 5 06 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 136 GTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGC 195 

Qy 50 7 GAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCAC 566 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 196 GAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCAC 255 

Qy 56 7 TTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTT 626 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 256 TTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTT 315 

Qy 62 7 CTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGG 6 86 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 316 CTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGG 3 75 

Qy 68 7 AGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCC 7 46 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 376 AGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCC 435 

Qy 74 7 CCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGT 8 06 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 436 CCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGT 495 

Qy 80 7 TTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATA 86 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 496 TTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATA 555 

Qy 867 AAGATCCTCTGTAAC 881 

M M I M I I I I I I I I 
Db 556 AAGATCCTCTGTAAC 5 70 



RESULT 10 

AA760709/C 

LOCUS 

DEFINITION 



AA760709 600 bp rtiRNA linear EST 07-FEB-1998 

nzl3cll.sl NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE : 1287668 3', 

mRNA sequence . 
AA760709 

AA760709.1 GI:2809639 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
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REFERENCE 1 (bases 1 to 600) 

AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Louis M. Staudt, M.D., Ph.D., David Allman, 
Ph.D., Gerald Marti, M.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can b 
found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 
Insert Length: 773 Std Error: 0.00 
Seq primer: -40ml3 fwd. ET from Amersham 
High quality sequence stop: 507. 
FEATURES Location/Qualifiers 
source 1. .600 

/organism="Homo sapiens" 

/mol_type=" mRNA " 

/db_xref="taxon: 96 06" 

/clone="IMAGE: 1287668" 

/tissue_type="germinal center B cell" 

/lab_host="DH10B" 

/ c 1 one_l ib= " NC I_CGAP_GCB 1 " 

/note="Vector : pT7T3D-PacI; Site_l : Not I; Site_2 : Eco RI 
1st strand cDNA was prepared from human tonsillar cells 
enriched for germinal center B cells by flow sorting 
(CD20+, IgD-), provided by Dr. Louis M. Staudt (NCI), Dr. 
David Allman (NCI) and Dr. Gerald Marti (CBER) . cDNA 
synthesis was primed with a Not I - oligo(dT) primer 
[ 5 ' -TGTTACCAATCTGAAGTGGGAGCGGCCGCCTCATTTTTTTTTTTTTTTTTT-3 
] . Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Not I and cloned into the Not 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization, and was 
constructed by Bento Soares and M. Fatima Bonaldo." 



Query Match 61.7%; Score 5 43.8; DB 1; Length 6 00; 

Best Local Similarity 98.8%; Pred. No. 1.8e-122; 

Matches 578; Conservative 0; Mismatches 4; Indels 3; Gaps 3 

Qy 298 ACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCG 35 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 600 ACGGGCATGATGCTGCCGAGGGACTT-GGGAGCACAGTCTGGCTGGATGAGACAGGGTCG 5 42 
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Qy 358 TGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGC 417 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 TGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGC 482 

Qy 418 TGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGT 477 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 TGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGT 422 

Qy 478 GTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGC 53 7 

I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGC 36 2 

Qy 53 8 AGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGA 59 7 

I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 AGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGA 3 02 

Qy 598 GTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCG 657 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCG 242 

Qy 658 CCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCC 717 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 CCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTNN 182 

Qy 718 CAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGC 777 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 C-CCCCCCAGGGCTGTGCAAACACATGCCCCTCCCATAAGCACCAACAAGAACTTCTTGC 123 

Qy 778 A-GGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGG 836 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 122 AGGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGG 63 

Qy 83 7 ATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 62 ATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 18 



BQ025351 
UI-l-BBlp-aud- 
UI-l-BBlp-aud- 
BQ025351 
BQ025351.1 GI:19760630 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata 



5 73 bp mRNA linear EST 2 7-MAR-2002 
-02-0-UI.sl NCI_CGAP_P16 Homo sapiens cDNA clone 
-02-0-UI 3', mRNA sequence. 



Craniata; Vertebrata; Euteleostomi ; 
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Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 5 73) 

AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Dr. Steven Brown 
cDNA Library preparation: Dr. M. Bento Scares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Scares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Scares, University of Iowa 
Clone Distribution: Clone distribution information can be obtained 
from Dr. M. Bento Soares, bento-soares@uiowa.edu 
Seq primer: M13 FORWARD 
POLYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .573 

/organism="Homo sapiens" 
/mo l_t y pe= " mRNA " 
/db_xref="taxon: 96 06" 
/clone="UI-l-BBlp-aud-e-02-0-UI" 
/ t is sue_type=" Placenta" 
/dev_stage="Full Term" 
/lab_host="DH10B (Life Technologies)" 
/ c 1 one_l ib= " NC I_CGAP_P 1 6 " 

/note="Organ : Placenta; Vector: pT7T3-Pac (Pharmacia) with 
a modified polylinker; Site_l : EcoR I; Site_2 : Not I; 
NCI_CGAP_P16 is a subtracted cDNA library constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT)18 tail. The 
sequence tags for this library are GA, AGGAA. For 
additional information, contact: Bento Soares, 
bent o-s oar es@uiowa . edu 
TAG_TISSUE=placenta human full term 
TAG_L IB=UI - 1 -BB Ip 
TAG_SEQ=AGGAA" 

ORIGIN 

Query Match 61.3%; Score 539.8; DB 4; Length 5 73; 

Best Local Similarity 99.5%; Pred. No. 1.7e-121; 

Matches 552; Conservative 0; Mismatches 2; Indels 1; Gaps 1; 



http://es/ScoreAccessWeb/GetItem.action?AppId=105295...7_122917_us-10-529-592a-l.rst&ItemType=4&startByte=0 (25 of 35)5/19/2009 9:51:32 AM 



SCORE Search Results Details for Application 10529592 and Search Result 20090427_122917_us-10-529-592a-l.rst 



Qy 32 7 GAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGA 386 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 56 9 GAGGACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGA 510 

Qy 38 7 AGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAG 446 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 509 AGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAG 45 0 

Qy 44 7 GTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGC 5 06 

M II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II II II II II II 
Db 449 TTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGC 39 0 

Qy 50 7 GAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCAC 56 6 

I I I I I I I I I I I I I I I I I I M M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 389 GAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCAC 33 0 

Qy 56 7 TTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTT 6 26 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 329 TTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTT 2 70 

Qy 627 CTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGG 686 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 9 CTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGG 210 

Qy 68 7 AGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCC 7 46 

I I I I I I I I I I I I I M M M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 20 9 AGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCC 15 0 

Qy 74 7 CCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGT 806 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 149 CCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGT 9 0 

Qy 80 7 TTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATA 866 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 8 9 TTTACAGATACGGAAACAGTCCAAAATGGGATTTATAA-TTCTTTTTTGCATTATAAATA 31 

Qy 86 7 AAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I 
Db 3 0 AAGATCCTCTGTAAC 16 



RESULT 12 

BQ025360/C 

LOCUS 

DEFINITION 



BQ025360 
UI-l-BBlp-aud-f 
UI-l-BBlp-aud-f 
BQ025360 

BQ025360.1 GI:19760639 



581 bp mRNA linear EST 27-MAR-2002 
02-0-UI.sl NCI_CGAP_P16 Homo sapiens cDNA clone 
02-0-UI 3', mRNA sequence. 
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KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 581) 

AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Dr. Steven Brown 
cDNA Library preparation: Dr. M. Bento Scares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Scares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Scares, University of Iowa 
Clone Distribution: Clone distribution information can be obtained 
from Dr. M. Bento Scares, bento-soares@uiowa.edu 
Seq primer: M13 FORWARD 
POLYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .581 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 96 06" 
/clone="UI-l-BBlp-aud-f-02-0-UI" 
/ t is sue_type=" Placenta" 
/dev_stage="Full Term" 
/lab_host="DH10B (Life Technologies)" 
/clone_lib="NCI_C GAP_P 1 6 " 

/note="Organ : Placenta; Vector: pT7T3-Pac (Pharmacia) with 
a modified polylinker; Site_l : EcoR I; Site_2 : Not I; 
NCI_CGAP_P16 is a subtracted cDNA library constructed 
according to Bonaldo, Lennon and Scares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT)18 tail. The 
sequence tags for this library are GA, AGGAA. For 
additional information, contact: Bento Scares, 
bento-soares@uiowa . edu 
TAG_TISSUE=placenta human full term 
TAG_L IB=UI - 1 -BB Ip 
TAG_SEQ=AGGAA" 
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Query Match 61.3%; Score 539.8; DB 4; Length 581; 

Best Local Similarity 99.5%; Pred. No. 1.7e-121; 

Matches 552; Conservative 0; Mismatches 2; Indels 1; Gaps 1; 

Qy 32 7 GAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGA 3 86 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 56 9 GAGGACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGA 510 

Qy 387 AGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAG 446 

I M II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I II II II 
Db 509 AGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAG 45 0 

Qy 44 7 GTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGC 5 06 

I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 449 TTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGC 390 

Qy 50 7 GAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCAC 566 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 389 GAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCAC 33 0 

Qy 56 7 TTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTT 6 26 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 329 TTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTT 2 70 

Qy 62 7 CTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGG 6 86 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 9 CTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGG 210 

Qy 68 7 AGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCC 7 46 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 209 AGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCC 150 

Qy 74 7 CCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGT 8 06 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 149 CCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGT 9 0 

Qy 80 7 TTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATA 86 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 89 TTTACAGATACGGAAACAGTCCAAAATGGGATTTATAA-TTCTTTTTTGCATTATAAATA 31 

Qy 867 AAGATCCTCTGTAAC 881 

II I I I I I I I I I I I I I 

Db 30 AAGATCCTCTGTAAC 16 



RESULT 13 
BM711587 

LOCUS BM711587 591 bp mRNA linear EST 28-FEB-2002 
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DEFINITION UI-E-CLl-afb-g-19-O-UI .rl UI-E-CLl Homo sapiens cDNA clone 

UI-E-CLl-afb-g-19-O-UI 5', rtiRNA sequence. 
ACCESSION BM711587 

VERSION BM711587.1 GI:19024845 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 591) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Soares,M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 

discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
PUBMED 8 8 895 48 
COMMENT Contact: Scares, MB 

Cancer Biology & Epigenomics Program 
Children's Memorial Research Center 

2300 Children's Plaza, Box 220, Chicago, IL 60614-3394, USA 
Tel: 773 755 6551 
Fax: 773 755 6378 

Email : mbsoares@childrensmemorial . org 
Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Scares, Univeristy of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Scares, Univeristy of Iowa 
DNA Sequencing by: Dr. M. Bento Scares, Univeristy of Iowa 
Clone Distribution: Researchers may obtain clones from Research 
Genetics (www.resgen.com). 
Seq primer: M13 Reverse. 
FEATURES Location/Qualifiers 
source 1. .591 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 96 06" 
/clone="UI-E-CLl-afb-g-19-0-UI" 
/tissue_type="human retina" 
/dev_stage=" adult" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI-E-CLl" 

/note="Organ : eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l : EcoR I; Site_2 : Not I; 
UI-E-CLl is a normalized cDNA library containing the 
following tissue(s): retina. The library was constructed 
according to Bonaldo, Lennon and Scares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
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vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT)18 tail. The 
sequence tag for this library is CCGCG. This library was 
created for the program. Gene Discovery in the Visual 
System, supported by National Eye Institute (NEI)." 



Query Match 60.9%; Score 536.4; DB 4; Length 591; 

Best Local Similarity 96.0%; Pred. No. 1.2e-120; 

Matches 5 49; Conservative 0; Mismatches 23; Indels 0; Gaps 0; 

Qy 30 7 ATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGAT 36 6 

III II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 20 ATGGGGCTTAAGATGTCCTGCCTGAAAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGAT 79 

Qy 36 7 GATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTG 426 

I I I I I I I I I I I I I I I I I I I I I M M I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 8 0 GATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTG 13 9 

Qy 42 7 CTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAAT 486 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 140 CTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAAT 199 

Qy 48 7 ACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGC 5 46 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 200 ACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGC 25 9 

Qy 54 7 GAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGG 6 06 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 26 0 GAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGG 319 

Qy 60 7 ACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCC 666 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 320 ACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCC 3 79 

Qy 66 7 TGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCA 726 

M M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 380 TGGCTGTTGGGGTCNCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCA 439 

Qy 72 7 GGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGT 786 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 44 0 GGGCTGTGCAAACACATGCCCCTCGCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGT 49 9 

Qy 78 7 GGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATT 8 46 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 500 GGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATT 559 

Qy 847 TCTTTTTTGCATTATAAATAAAGATCCTCTGT 878 
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Db 560 TCTTTTTTGCATTTATAATANAGATCCTCTGT 591 



RESULT 14 

AI973037/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



539 bp mRNA linear EST 08-MAR-20 00 
Homo sapiens cDNA clone IMAGE : 2490782 3', 



AI973037 

wr46g08.xl NCI_CGAP_Pr28 
mRNA sequence . 
AI973037 

AI973037.1 GI:5769863 
EST. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 539) 

NCI-CGAP http : //www . ncbi . nlm. nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 
Tumor Gene Index 
Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Michael J. Brownstein, M.D., Ph.D., Michael R. 
Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Scares, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
www-bio . llnl . gov/bbrp/image/ image . html 
Insert Length: 768 Std Error: 0.00 
Seq primer: -40UP from Gibco 
High quality sequence stop: 458. 
Location/Qualifiers 
1. .539 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 96 06" 

/clone=" IMAGE: 2 490 782" 

/sex="male" 

/dev_st age= " adult " 

/lab_host="DH10B" 

/ clone_l ib= " NC I_CGAP_Pr 2 8" 

/note="Organ: prostate; Vector: pT7T3D-PacI; Plasmid DNA 
from the normalized library NCI_CGAP_Pr22 was prepared, 
and ss circles were made in vitro. Following HAP 
purification, this DNA was used as tracer in a subtractive 
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hybridization reaction. The driver was PCR-amplif ied cDNAs 
from a pool of 5,000 clones made from the same library 
(clonelDs 985608-986759, 1101192-1101959, and 
1217928-1220615). Subtraction by Bento Scares and M. 
Fatima Bonaldo . " 



Query Match 60.8%; Score 536; DB 1; Length 539; 

Best Local Similarity 100.0%; Pred. No. 1.5e-120; 

Matches 536; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 345 TGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCT 40 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 53 9 TGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCT 48 0 

Qy 405 GGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCT 46 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 479 GGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCT 42 0 

Qy 465 GCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTC 52 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 419 GCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTC 36 0 

Qy 525 AGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCC 58 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 9 AGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCC 300 

Qy 585 CAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTG 6 44 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 299 CAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTG 2 40 

Qy 645 GTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGAC 70 4 

M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 239 GTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGAC 180 

Qy 705 AAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAAC 76 4 

I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 179 AAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAAC 12 0 

Qy 765 AAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACA 82 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 119 AAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACA 6 0 

Qy 825 GTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAA 880 

I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I 
Db 59 GTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAA 4 



RESULT 15 
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CB993691 

LOCUS CB993691 843 bp rtiRNA linear EST Ol-MAY-2003 

DEFINITION AGENCOURT_1362 4626 NIH_MGC_148 Homo sapiens cDNA clone 
IMAGE: 30338285 5', mRNA sequence. 

ACCESSION CB993691 

VERSION CB993691.1 GI:30288211 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 843) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpublished (1999) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Stefan Hansson 

cDNA Library Preparation: Michael J. Brownstein (NHGRI) with help 
and advice from Piero Carninci (RIKEN) 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : / / image . llnl . gov 
Plate: NDAM36 4 row: 1 column: 06 
High quality sequence stop: 408. 
FEATURES Location/Qualifiers 
source 1. .843 

/organism="Homo sapiens" 

/ mo l_t ype= " mRNA " 

/db_xref="taxon: 96 06" 

/clone=" IMAGE: 30338285" 

/tissue_type="pre-eclamptic placenta" 

/lab_host="DH10B TonA" 

/clone_lib="NIH_MGC_148" 

/note="Organ : placenta; Vector: pBluescriptR; Site_l : 
all-XhoI; Site_2: BamH; Library is oligo-dT primed and 
directionally cloned using primer 

5 - _XTTTTTTTTTTTTTTTVN-3 ' , size-selected for average insert 
size 2.3 kb and normalized to ROT 5. This is a primary 
library enriched for full-lenght clones and constructed 
using the Cap-trapper method (Carninci, in preparation) . 
Library constructed by M. Brownstein (NIMH/NHGRI, 
National Institutes of Health). Note: this is a NIH_MGC 
Library . " 
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Query Match 60.2%; Score 530.2; DB 7; Length 8 43; 

Best Local Similarity 92.5%; Pred. No. 4.2e-119; 

Matches 580; Conservative 0; Mismatches 43; Indels 4; Gaps 2; 

Qy 6 ATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGACCGCCG 6 5 

I I M M M M I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 28 ATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGACCGCCG 8 7 

Qy 6 6 CCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGCAGTGA 125 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 8 8 CCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGCAGTGA 147 

Qy 126 GCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCT 185 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 148 GCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCT 20 7 

Qy 186 GAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCT 2 45 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 20 8 GAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCT 26 7 

Qy 246 GAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCAT 3 05 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 26 8 GAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCAT 32 7 

Qy 306 GATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGA 36 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 328 GATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGA 38 7 

Qy 366 TGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCT 425 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 38 8 TGATGGAGAAATTGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCCGGCTGGCTCCT 44 7 

Qy 426 GCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAA 485 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 448 GCTCCAGCGGCTCGGCTTCAGGTGTCCGGCGGCGTGGCTGCCTGGACCAGGCGTGCTGGA 507 

Qy 486 TACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAG 5 45 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 508 TACCCTGGATGGGAACTGAGCGAACCTGGGCCTCCGCTCATAGAGACGTGTCAGGGACGT 56 7 

Qy 546 CGAGGAATCCAG-CCTGTCCACTTCCAGAACAGTGTTTCCCAGG CCCCGCTGAGTGG 601 

I I I I I I I I I I I I I I I I I Mill I I I I I I I I III II I 

Db 56 8 GTAGGGATCCAGCCCTGTTCCCCTTTTAAACAGCTTTTCCCCGGTGCCCTCCTTTATGGA 62 7 

Qy 6 02 ACCGGACCTCTGACACCTCCAGGTTCT 62 8 

I I I I I I I I I I I I I I I III 
Db 628 CCGGAACCTCCGAAACCTCCCCGGTTT 654 
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